Crowd Counting Via Perspective-Guided Fractional-Dilation Convolution

نویسندگان

چکیده

Crowd counting is critical for numerous video surveillance scenarios. One of the main issues in this task how to handle dramatic scale variations pedestrians caused by perspective effect. To address issue, paper proposes a novel convolution neural network-based crowd method, termed Perspective-guided Fractional-Dilation Network (PFDNet). By modeling continuous variations, proposed PFDNet able select proper fractional dilation kernels adapting different spatial locations. It significantly improves flexibility state-of-the-arts that only consider discrete representative scales. In addition, avoiding multi-scale or multi-column architecture used other methods, it computationally more efficient. practice, constructed stacking multiple Convolutions (PFC) on VGG16-BN backbone. introducing generalized operation, PFC can ratios domain under guidance annotations, achieving scales pedestrians. deal with problem unavailable information some cases, we further introduce an effective estimation branch PFDNet, which be trained either supervised weakly-supervised setting once has been pre-trained. Extensive experiments show outperforms state-of-the-art methods ShanghaiTech A, B, WorldExpo'10, UCF-QNRF, UCF_CC_50 and TRANCOS dataset, MAE 53.8, 6.5, 6.8, 84.3, 205.8, 3.06 respectively.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Depth Information Guided Crowd Counting for Complex Crowd Scenes

It is important to monitor and analyze crowd events for the sake of city safety. In an EDOF (extended depth of field) image with a crowded scene, the distribution of people is highly imbalanced. People far away from the camera look much smaller and often occlude each other heavily, while people close to the camera look larger. In such a case, it is difficult to accurately estimate the number of...

متن کامل

Crowd counting via scale-adaptive convolutional neural network

The task of crowd counting is to automatically estimate the pedestrian number in crowd images. To cope with the scale and perspective changes that commonly exist in crowd images, state-of-the-art approaches employ multi-column CNN architectures to regress density maps of crowd images. Multiple columns have different receptive fields corresponding to pedestrians (heads) of different scales. We i...

متن کامل

Crowd Counting via Weighted VLAD on Dense Attribute Feature Maps

Crowd counting is an important task in computer vision, which has many applications in video surveillance. Although the regression-based framework has achieved great improvements for crowd counting, how to improve the discriminative power of image representation is still an open problem. Conventional holistic features used in crowd counting often fail to capture semantic attributes and spatial ...

متن کامل

Unsupervised Crowd Counting

Most crowd counting methods rely on training with labeled data to learn a mapping between image features and the number of people in the scene. However, the nature of this mapping may change as a function of the scene, camera parameters, illumination etc., limiting the ability of such supervised systems to generalize to novel conditions. Here we propose an alternative, unsupervised strategy. Th...

متن کامل

Counting with the Crowd

In this paper, we address the problem of selectivity estimation in a crowdsourced database. Specifically, we develop several techniques for using workers on a crowdsourcing platform like Amazon’s Mechanical Turk to estimate the fraction of items in a dataset (e.g., a collection of photos) that satisfy some property or predicate (e.g., photos of trees). We do this without explicitly iterating th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Multimedia

سال: 2022

ISSN: ['1520-9210', '1941-0077']

DOI: https://doi.org/10.1109/tmm.2021.3086709